Metadata services for the Parallax storage system

نویسنده

  • Gitika Aggarwal
چکیده

Parallax is a distributed storage system that uses virtualization to provide storage facilities specifically for virtual environments. In Parallax, fragmentation occurs when the block addresses visible to the guest virtual machine are sequentially placed, but the corresponding physical addresses are not. Because of the copyon-write (CoW) nature of Parallax, as virtual disks are created, cloned, deleted, snapshotted and migrated, some fragmentation of the physical media can occur, potentially incurring seeks even when performing sequential accesses to the virtual disk. As the storage pool ages, performance issues due to unchecked fragmentation, unreclaimed storage space and duplicate data can cause significant concern. CoW snapshots also introduce sharing semantics between virtual disks and snapshots. The ability to create CoW clones of virtual disks from snapshots of other virtual disks leads to more sharing relationships. As a result block reclamation and allocation become non-trivial. We have developed utilities for garbage collecting, de-fragmenting free disk space and virtual disks and reclaiming duplicate read-only blocks in the storage pool managed by Parallax. They work by updating and maintaining the metadata structures related to each virtual disk and its snapshots. They use very coarse grained locking on the metadata and work at the block level. They operate across the storage pool and are agnostic to the operating systems and file systems used by the virtual machines.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimal Scheduling of Coordinated Wind-Pumped Storage-Thermal System Considering Environmental Emission Based on GA Based Heuristic Optimization Algorithm

The integration of renewable wind and pumped storage with thermal power generation allows for dispatch of wind energy by generation companies (GENCOs) interested in participation in energy and ancillary services markets. However, to realize the maximum economic profit, optimal coordination and accounting for reduction in cost for environmental emission is necessary. The goal of this study is to...

متن کامل

A System for Storing, Retrieving, Organizing and Managing Web Services Metadata Using Relational Database*

In this paper we present our system for efficient storage, update, and retrieval of web service metadata documents in relational database. Initially we investigate all of the existing approaches for efficient storage and retrieval of XML documents in relational database. As a result we selected a recently proposed structure-centered storage schema named DLN (Dynamic Level Numbering). As there i...

متن کامل

Scalable Performance of the Panasas Parallel File System

The Panasas file system uses parallel and redundant access to object storage devices (OSDs), per-file RAID, distributed metadata management, consistent client caching, file locking services, and internal cluster management to provide a scalable, fault tolerant, high performance distributed file system. The clustered design of the storage system and the use of clientdriven RAID provide scalable ...

متن کامل

Distributed Data Repository Supporting Ad-Hoc Collaborations

This paper presents the design and implementation of a distributed data repository for Grid environments that supports secure sharing of possibly confidential data by members of ad-hoc created groups. The system is composed of three independent services metadata service, replica locator service and storage service. The group-based access control is achieved through augmentation of the user cred...

متن کامل

Lattice QCD Data and Metadata Archives at Fermilab and the International Lattice Data Grid

The lattice gauge theory community produces large volumes of data. Because the data produced by completed computations form the basis for future work, the maintenance of archives of existing data and metadata describing the provenance, generation parameters, and derived characteristics of that data is essential not only as a reference, but also as a basis for future work. Development of these a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008